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Abstract 

We study the general (non-metric) facility-location and 
weighted fc-medians problems, as well as the fractional 
facility-location and fc-medians problems. We describe a 
natural randomized rounding scheme and use it to derive 
approximation algorithms for all of these problems. 

For facility location and weighted fc-medians, the re- 
spective algorithms are polynomial-time [H&k + d\- and 
[(1 + e)d;\n(n + n/e)k] -approximation algorithms. These 
performance guarantees improve on the best previous per- 
formance guarantees, due respectively to Hochbaum (1982) 
and Lin and Vitter (1992). For fractional fc-medians, the al- 
gorithm is a new, Lagrangian-relaxation, [(1 + e)d, (1 + e)k]- 
approximation algorithm. It runs in 0(fcln(n/e)/e 2 ) linear- 
time iterations. 

For fractional facilities-location (a generalization of frac- 
tional weighted set cover), the algorithm is a Lagrangian- 
relaxation, JYl + e)fc]-approximation algorithm. It runs in 
0(nln(n)/e ) linear-time iterations and is essentially the 
same as an unpublished Lagrangian-relaxation algorithm 
due to Garg (1998). By recasting his analysis probabilis- 
tically and abstracting it, we obtain an interesting (and as 
far as we know new) probabilistic bound that may be of 
independent interest. We call it the Chernoff-Wald bound. 

1 Problem definitions 

The input to the weighted set cover problem is 

a collection of sets, where each set s is given a cost 
cost(s) G IR + . The goal is to choose a cover (a collection 
of sets containing all elements) of minimum total cost. 

The (uncapacitated) facility-location problem is 
a generalization of weighted set cover in which each 
set / (called a "facility") and element c (called a 
"customer") are given a distance dist(/, c) £ IR + U{oo}. 
The goal is to choose a set of facilities F minimizing 
cost(_F) + dist(_F), where cost(.F), the facility cost of 
F, is X)feF C0St (/) an< l dist(-F), the assignment cost 
of F, is £ c min /eF dist(/, c). 

Fig. n shows the standard integer programming 
formulation of the problem — the facility-location IP 
\l'2i p. 8]. The facility-location linear program (LP) is 
the same except without the constraint u x(f) £ {0, 1}". 
A fractional solution is a feasible solution to the LP. 
Fractional facility location is the problem of solving the 
LP. 

^Research partially funded by NSF CAREER award CCR- 
9720664. Department of Computer Science, Dartmouth College, 
Hanover, NH. neyOcs.dartmouth.edu 
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Figure 1: The facility-location IP. Above d, k, x(f) and 
x(f,c) are variables, dist(x), the assignment cost of x, 
is z~2fc x (f> c )dist(/, c), and cost(x) 1 the facility cost of 
x, is J2f x if) cost (f)- 

The weighted fc-medians problem is the same 
as the facility location problem except for the following: 
a positive real number k is given as input, and the goal 
is to choose a subset F of facilities minimizing d±st(F) 
subject to the constraint cost(i 7 ') < k. The standard 
integer programming formulation is the k-medians IP, 
which differs from the IP in Fig.^only in that k is given, 
not a variable, and the objective function is d instead 
of d + k. The (unweighted) fc-medians problem is the 
special case when each cost(/) = 1. For the fractional 
fc-medians problem, the input is the same; the goal is 
to solve the linear program obtained by removing the 
constraint u x(f) £ {0, 1}" from the fc-medians IP. 

We take the "size" of each of the above problems to 
be the number of pairs (/, c) such that dist(/, c) < oo. 
We assume the size is at least the number of customers 
and facilities. 

By an [a(d) ; /3(fc)]-approximation algorithm for 
fc-medians, we mean an algorithm that, given a problem 
instance for which there exists a fractional solution of 
assignment cost d and facility cost fc, produces a solution 
of assignment cost at most a(d) and facility cost at most 
/3(fc). We use similar non-standard notations for facility 
location, set cover, and fc-set cover. For instance, by 
a [d + 2fc]-approximation algorithm for facility location, 
we mean an algorithm that, given an instance for which 
there exists a fractional solution of assignment cost d 
and facility cost fc, produces a solution for which the 
assignment cost plus the facility cost is at most d + 2k. 
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2 Background 

In the mid 1970's Johnson and Lovasz gave a greedy 
[i?A ^-approximation algorithm for unweighted set 
cover [51 In 1979 Chvatal generalized it to a [H&k]- 
approximation algorithm for weighted set cover 

In 1982 Hochbaum gave a greedy \H&(d + k)]- 
approximation algorithm for the uncapacitated facility- 
location problem by an implicit reduction to the 
weighted set-cover problem [S]. 1 Above A is at most 
the maximum, over all facilities /, of the number of 
customers c such that dist(/, c) < oo. 

In 1992 Lin and Vitter gave a polynomial-time 
[(1 + e)d; (1 + l/e)(lnn + l)fc]-approximation algorithm 
for the fc-medians problem ^UJ. (Here e > is an 
input parameter that tunes the tradeoff between the 
two criteria and n is the number of customers.) Their 
algorithm combines a greedy algorithm with a technique 
they call filtering. 

In 1994 Plotkin, Shmoys, and Tardos (PST) gave 
Lagrangian-relaxation algorithms for general packing 
and covering problems |13|. As a special case, their 
algorithms imply a [(1 + e)fc]-approximation algorithm 
for fractional set cover that runs in 0(fcln(n)/e 2 ) linear- 
time iterations. Here n is the number of elements. 

In 1998 Garg generalized and simplified the PST 
set cover algorithm to obtain a [(1 + e)fc]-approximation 
algorithm for fractional weighted set cover Garg's 
algorithm runs in 0(nln(n)/e 2 ) linear-time iterations. 
By Hochbaum's reduction, one can use Garg's algorithm 
as a [(1 + e)fc]-approximation algorithm for fractional 
facility location. The running time is the same, where 
n is the number of customers. 

Recent work has focused on metric fc-medians and 
facility location problems. In the metric versions, the 
distance function is assumed to satisfy the triangle in- 
equality. For example, [0(d + fc)]-approximation algo- 
rithms have recently been shown for the metric facility- 
location problem [7|E3- Charikar, Guha, Tardos and 
Shmoys [2] recently gave an [0(d); fc]-approximation al- 
gorithm for metric fc-medians. Many of these algorithm 
first solve the fractional problems and then round the 
fractional solutions. 



1 Hochbaum's reduction is easy to adapt in order to reduce 
unweighted fc-medians to a variant of set cover that we call fc- 
set cover. The reduction also extends naturally to the fractional 
problems. (See appendix for details.) This shows a loose 
equivalence between facility location and weighted set cover, 
and between fc-medians and fc-set cover. However, Hochbaum's 
reduction does not preserve the distinction between facility costs 
and assignment costs. For this reason, we work with the facility 
location and fc-medians representations. The algorithms and 
analyses we give for facility location easily imply corresponding 
results for weighted set cover, and the results for fc-medians are 
straightforward to adapt to fc-set cover. 



3 Results 

Fig. [3 shows the simple randomized rounding scheme 
at the center of all our results. With minor variations 
(e.g. Fig. |3J, this rounding scheme can be used as 
the basis for approximation algorithms for set cover, 
weighted set cover, facility location, and fc-medians, and 
as the basis for Lagrangian-relaxation algorithms for the 
fractional variants of these problems. 

Although essentially the same rounding scheme suf- 
fices for each of these problems, the respective proba- 
bilistic analyses require different (albeit standard) tech- 
niques in each case. For set cover, a simple direct 
analysis suffices |16| . For weighted set cover, facility 
location, and fc-medians, a basic probabilistic lemma 
called Wald's inequality is necessary. For fractional set 
cover and fc-medians, the analysis rests on the Chernoff 
bound. For fractional weighted set cover and facility lo- 
cation, the analysis is a simple application of what we 
call the Chernoff- Wald bound. 

For each problem, we apply the method of condi- 
tional probabilities to the rounding scheme in order to 
derive a corresponding approximation algorithm. The 
structure of each resulting algorithm, being closely tied 
to the underlying probabilistic analysis, ends up differ- 
ing substantially from problem to problem. 

For facility location, the resulting algorithm (Fig.|2J 
is a randomized rounding, polynomial-time [d + H&k]- 
approximation algorithm. The performance guarantee 
improves over Hochbaum's algorithm with respect to 
the assignment costs. 

For weighted fc-medians, the resulting algorithm 
(Fig. |3J) is a greedy [(l+e)d; ln(n+n/e)fc]-approximation 
algorithm. In comparison to Lin and Vitter's algorithm, 
the performance ratio with respect to the facility costs 
is better by a factor of roughly 1/e. 

For fractional fc-medians, the algorithm (Fig. [SJ 
is a [(1 + e)d, (1 + e)fc]-approximation algorithm. It 
is a Lagrangian-relaxation algorithm and runs in 
0(k ln(n/e)/e 2 ) linear-time iterations. This is a factor 
of n faster than the best bound we can show by applying 
the general algorithm of Plotkin, Shmoys and Tardos. 

Finally, for fractional facility location, the algo- 
rithm (see Fig. |SJ) is a [(1 + e)(d + fc)]-approximation, 
Lagrangian-relaxation algorithm. It runs in at most 
0(nln(n)/e 2 ) iterations, where each iteration requires 
time linear in the input size times Inn. This algorithm 
is the same as the unpublished fractional weighted set- 
cover algorithm due to Garg 

The main interest of this last result is not that 
we improve Garg's algorithm (we don't!), but that 
we recast and abstract Garg's analysis to obtain an 
apparently new (?) probabilistic bound — we call it the 
Chernoff- Wald bound — that may be of general interest 



3 



for probabilistic applications. 2 

A basic contribution of this work is to identify 
and abstract out (using the probabilistic method) com- 
mon techniques underlying the design and analysis of 
Lagrangian-relaxation and greedy approximation algo- 
rithms. 

4 Wald's inequality and Chernoff-Wald bound 

Before we state and prove Wald's inequality and the 
Chernoff-Wald bound, we give some simple examples. 
Suppose we perform repeated trials of a random ex- 
periment in which we roll a 6-sided die and flip a fair 
coin. We stop as soon as the total of the numbers 
rolled exceeds 3494. Let T be the number of trials. 
Let D < 3500 be the total of the numbers rolled. Let 
H be the number of flips that come up heads. Since the 
expectation of the number in each roll is 3.5, Wald's 
implies E[D] = 3.5E[T], which implies E[T] < 1000. 
Since the probability of a head in each coin flip is 0.5, 
Wald's implies E[H] =0.5E[T]. Thus we can conclude 
E[H] < 500. 

Now modify the experiment so that in each trial, 
each person in a group of 50 flips their own fair coin. Let 
M be the maximum number of heads any person gets. 
Then Chernoff-Wald states that E[M] < (1 + e)500 for 
e « 0.128 (so (l + e)500 « 564). 

If we were to modify the experiment so that the 
number of trials T was set at 1000, the Chernoff-Wald 
bound would give the same conclusion, but in that case 
the Chernoff bound would also imply (for the same e) 
that Pr[M > (1 + e)500] < 1 

Lemma 4.1. (Wald's inequality) Let T e IN + be a 

random variable with E[T] < oo and let Xi,X2, ... be a 
sequence of random variables. Let fi, c G IR . If 

E[X t \T>t}< n {for t > 1) 

and X t < c (for t — 1, . . . , T), then 

E{X t + X 2 + --- + X T ) < fiE(T). 

The claim also holds if each "< " is replaced by "> ". 

The condition u X t < c" is necessary. Consider 
choosing each X t randomly to be ±2* and letting T = 
min{i : X t > 0}. Then E[X t \T > t] = 0, so taking 
fi = 0, all conditions for the theorem except u X t < c" 



^The Chernoff- Wald bound also plays a central role in 
the randomized-rounding interpretation of the Garg and Kone- 
mann's recent multicommodity-fiow algorithm |5|. This and 
the general connection between randomized rounding and 
greedy/Lagrangian-relaxation algorithms are explored in depth 
in the journal version of II (il . which as of October 1999 is still 
being written. 



are met. But the conclusion E[X\ + X 2 + ■ ■ ■ + Xt] < 
does not hold, because X\ + X 2 + • • • + Xt = 1 ■ 

The proof is just an adaptation of the proof of 
Wald's equation ^ p. 370]. The reader can skip it on 
first reading. 

Proof. W.l.o.g. assume = 0, otherwise apply the 
change of variables X' t = Xt — [i before proceeding. 
Define Y t = Xi + X 2 + ■ ■ ■ + X t . If E(Y T ) = -oo then 
the claim clearly holds. Otherwise, 

oo 

E(Y T ) = ^ Pr(T = t)E(Y t \ T = t) 
t=i 

oo t 
t=l s=l 

The sum of the positive terms above is at most 
EtE s < t Pr(T = t)c = E f Pr(T = t)ct = E[cT] < oo. 
Thus, the double sum is absolutely convergent so 

OO CO 

= ££Pr(T = i)i?(X s |T = t) 

s— 1 t—s 

oo 

= ^¥i{T > s)E(X s \T > s) 

s=l 

This establishes the claim because E(X S \ T > s) < 0. 
The claim with "<" 's replaced by ">" 's follows via the 
change of variables X' t = —X t , fi' = —fi, and d == —c. 

□ 

In many applications of Wald's, the random vari- 
ables Xi,X 2l ... will be independent, and T will be 
a stopping time for {Xt} — a random variable in 
IN + such that the event "T = t" is independent of 
{X t +i, X t + 2 , . . .}. In this case the following companion 
lemma facilitates the application of Wald's inequality: 

Lemma 4.2. Let X\,X 2l ... be a sequence of indepen- 
dent random variables and let T be a stopping time for 
the sequence. Then E[X t | T > t] = £J[X t ]. 

Proof. Because X t is independent of X\, X2, ■ ■ ■ , Xt-i, 
and the event "T > t" (i.e. T £ {1, 2, . . . ,t - 1}) is 
determined by the values of Xi,X 2 , . . . , X t -i, it follows 
that X t is independent of the event "T > t" . rj 

Here is a statement of a standard Chernoff bound. 
For a proof see e.g. [mirp] . 
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Let ch(e) = (1 + e) ln(l + e) - e > eln(l + e) /2. 
For e < 1, ch(e) > e 2 / 3 and cn (- £ ) > e 2 / 2 - 

Lemma 4.3. (Chernoff Bound 

Lei -X"i, X2, . . . , Xfc &e a sequence of independent random 
variables in [0, 1] iot£/i E(^2 i Xi) > jj,> 0. Let e > 0. 
Tfcera Pr E l > A*(l + e)] < exp(-ch(e)/x). 
For e < 1, Pr [£. X, < fi(l - e)] < exp(-ch(-e)/x). 

Theorem 4.1. (Chernoff- Wald Bound) 

For each i = 1, 2, . . . , m, let Xu, Xi2, ... be a sequence 
of random variables such that < Xu < 1 . 
Let T £ IN + be a random variable with E[T] < 00. 
Let M = max™ x Xu + X i2 + ■ • • + JQt. Suppose 

E[X U \T>t; {Xji :j <t,l<i<m}] < fi 

for all i, t for some fi £ IR. Let e > satisfy 

e -ch(e)max{/i£;[T],i?[M]/(l + e)} < 1/m 

TTien 

£[M] < {l + e)fj,E[T}. 

The claim also holds with the following replace- 
ments: "mini" for 'hiax 4 "; "> n" for "< /x"; 
"E[M] >" for "E[M] <"; and, for each "e", "-e" (ex- 
cept in "e > "). 

In the case when T is constant, the Chernoff bound 
implies Pr[M > (1 + f)^T\ < 1 for e as above. 

The first-time reader can skip the following proof. 



z ^ 0. Thus, conditioned on the event T > t and the 
values of Yi ; t-i(l < i < to), 



Proof. For t = 0, . . . , T, let Y« = X a + X a 
(i = 1, . . . , m) and 



X,, 



Z t =log 1+e ^(l + e) Y -. 

z=l 

Note that for each i, Z T > log 1+£ (l + e) y * T = Y zT . Thus 
M < Zt- We use Wald's inequality to bound E[Zt]- 
Fix any t > 0. Let y it = (1 + e)**-*- 1 . Then 



z t - z 



t-l 



= log'i 
< logi 

= log! 



< 



Ejjto(l+f)f? 

Y., vu 
Ejiftt(i+fgj0 

^iVitXi^ 
J2i Hit 

2Ji VitXit 



ln(l + e) w, 



The first inequality follows from (1 + e) z < 1 + ez 
for < z < 1. The last inequality follows from 
log 1+e 1 + z = ln(l + z)/ln(l + e) < z/ln(l + e) for 



E\Z t - Zt-i] < 



Ei Vit A* 



ln(l + e) Y,iV* 



This implies that £7[Z t - Z t -i\T > t] < /xe/ln(l + e). 
Using Zq — log 1+e m and Wald's inequality, 

E[M] < E[Z T ] < log 1+e m + £[7>e/ln(l + e). 

By algebra, the above together with the assumption on 
e imply that E[M] < (1 + e)/i£[T]. 

The proof of the claim for the minimum is essen- 
tially the same, with each "e" replaced by "— e" and 
reversals of appropriate inequalities. In verifying this, 
note that log 1 _ e is a decreasing function. rj 

In many applications of Chernoff- Wald, each Xu 
will be independent of {Xjt : I < t, 1 < j < m}, and T 
will be a stopping time for {^Qt} — a random variable 
in IN + such for any t the event "T = t" is independent of 
{Xu : £ > t, 1 < i < to}. Then by an argument similar 
to the proof of Lemma [4.21 we have: 

Lemma 4.4. For 1 < i < m, let Xn,X i2 ,. . . be 
a sequence of random variables such that each X it is 
independent of {Xj£ : £ < t, 1 < j < m}. Let T be a 
stopping time for {Xu}. Then 
E[X lt \T>t; {X fl :j<t,l<i<m}} = E[X it ] ■ 



5 Randomized rounding for facility location. 

We use Wald's inequality to analyze a natural random- 
ized rounding scheme for facility location. 

Guarantee 5.1. Let F be the output of the facility- 
location rounding scheme in Fig. \^ given input x. Let 
Af x be the number of customers c such that x(f,c) > 0. 
Then Ep [dist(F') + cost(F)] is at most dist(a;) + 
V, cost; /;,•(/ i// A . . 

Proof. Observe the following basic facts about each 
iteration of the outer loop: 

1. The probability that a given customer c is as- 
signed to a particular facility / in this iteration 
is x(f, c)/\x\. 

2. The probability that c is assigned to some facility 
\sY.fx{f,c)/\x\>l/\x\. 

3. Given that c is assigned, the probability of it being 
assigned to a particular / is x(f,c). 
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input: fractional facility location solution x. 

output: random solution F C T s.t. Ep[dist(F) + cost^ )] < dist(x) + H A cost(x) . 

1. Repeat until all customers are assigned: 

2. Choose a single facility / at random so that Pr(/ chosen) = x(f)/\x\. 

3. For each customer c independently with probability x(s, e)/x{s): 

4. Assign (or, if c was previously assigned, reassign) c to /. 

5. Return the set containing those facilities having customers assigned to them. 



Figure 2: Facility-location rounding scheme. Note \x\ = J2 S X ( S )- 



To bound _E[dist(F)], it suffices to bound the ex- 
pected cost of the assignment chosen by the algo- 
rithm. For a given pair (/, c) , what is the prob- 
ability that / is assigned to c? By the third 
fact above, this is x(f,c). Thus, E[dist(F)] < 
E/ c Pr(c assigned to /)dist(/, c) = dist(aj). 
' 'To finish, we show Pr(/ 6 F) < H Afx x(f). 
This suffices because it implies that E[cost(F)] equals 
E/ Pr(/ G F)cost(/) < E/ H Afm x(f)cost(f). 

Fix a facility /. Call the customers c such that 
x(f, c) > the "fractional customers of /" . Let random 
variable T be the number of iterations before all these 
customers are assigned, 
claim 1: Pr(/ eF)< E[T}x(f)/\x\. 
proof: Define X t to be the indicator variable for the 
event "/ is first chosen in iteration t" . 

As E[X t \T > t] < x(f)/\x\, by Wald's inequality 
Pr(/ e F) = E[Xt +X 2 + --- + X T ]< E[T]x{f)/\x\. 
This proves the claim. 

Recall that A/ x is the number of fractional cus- 
tomers of /. To finish the proof, it suffices to show: 
claim 2: E[T] < \x\H Afx 

proof: Define ut to be the number of fractional cus- 
tomers of / not yet assigned after iteration t (0 < t < 

T). (Recall H = and Hi = 1 + 1/2H hl/i.) Then 

provided t <T, H Ut — H Ut+1 is 



u t u t - f Ut+i - 1 u t 

The expectation of the right-hand side is at least l/\x\ 
because each customer is assigned in each iteration with 
probability at least l/|x|. Since H Ut is H Af:c when 
t = and decreases by at least l/\x\ in expectation 
each iteration, by Wald's inequality, it follows that 
E[H Afj: ~ H UT ] > E[T]/\x\. Since H Ut = 0, the claim 
follows. rj 

The randomized scheme can easily be derandomized. 

Corollary 5.f . There is a polynomial-time [d+H A k]- 
approximation algorithm for uncap, facility location. 



6 Greedy algorithm for weighted fc-medians. 

The k-medians rounding scheme takes a fractional fc- 
medians solution (x, fc, d) and an e > and outputs 
a random solution F. The scheme is the same as 
the facility-location rounding scheme in Fig. |3 except 
for the termination condition. The modified algorithm 
terminates after the first iteration in which the facility 
cost exceeds fc[ln(n + n/e)] or all customers are assigned 
with assignment cost less than d(l + e). We analyze 
this rounding scheme using Wald's inequality and then 
derandomize it to obtain a greedy algorithm (Fig. yj. 

Guarantee 6.1 . Let F be the output of the weighted 
k-medians rounding scheme given input x. Then 
cost(_F) < k ln(n + n/e) + max/ cost(/) and with pos- 
itive probability dist(-F) < d{\ + e). 

Proof. The bound on cost(_F) always holds due to the 
termination condition of the algorithm. 

Let random variable T be the number of iterations 
of the rounding scheme. Let random variable Ut be 
the number of not-yet-assigned customers at the end 
of round t (0 < t < T). By fact 2 in the proof of 
Guarantee O E[u t \u t -i AT > t] = (1 - l/\x\)u t -i. 

Define random variable d t to be the total cost of the 
current (partial) assignment of customers to facilities at 
the end of round t. Because each customer is reassigned 
with probability l/\x\ in each round, it is not to hard to 
show that E[d t \ d t -\ A T >t] = (l- l/\x\)d t -i + d/\x\. 

Define random variable Ct to be the total cost of 
the facilities chosen so far at the end of round t. In each 
iteration, _E[cost(/)] = k/\x\, so that E[ct | Ct-i AT > 
t] = Ct-i + k/\x\. 

Define <f> t = Ct/k + ln[u t + (d t /d - 1 ) / (f + e)]. 
Then using lnz < z — 1 and the two facts established 
in the preceding three paragraphs, a calculation shows 
E[(f> t - (f) t -i\dt-i,u t -i,c t -i] < 0. By Wald's inequality, 
this implies E[(/)t] < 4>o < Inn. Thus, with positive 
probability, 4>t < Inn. Assuming this event occurs, we 
will show that at the end all elements are covered and 
the assignment cost is not too high. 
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input: distQ, cost(), e, d, n. 

output: Set F of facilities s.t. cost(_F) < fcln(n + n/e) + maxj cost(/) and dist(_F) < (1 + e)d. 

1. Define <j)(c, f) = dist(c, f)/d(l + e). ... c's contrib. to u t + dt/d(l + e) if c assigned to f. 

2. For each customer c do: f c <— none. ... f c is the facility c is currently assigned to 

3. Define </>(c, none) = 1. ... contribution is 1 if c is unassigned 

4. Repeat until all customers are assigned with assignment cost < d(l + e): 

5. For each facility / define Cj = {c : (f>(c, f c ) > <f>{c, /)}. 

6. Choose a facility / to maximize E c ec ^( c > / c ) ~ ^( c ' /)] / cost (/)- 

7. For each c 6 C/ do: / c <— /. ... Assign c to f . 

8. Return the set of chosen facilities. 

Figure 3: The greedy fc-medians algorithm. 



If the rounding scheme terminates because all ele- 
ments are covered and the assignment cost is less than 
(1 + e)d, then clearly the performance guarantee holds. 
Otherwise the algorithm terminates because the facility 
cost ct exceeds ln(n + n/e)k. This lower bound on the 
size and the occurrence of the event "0t < In n" imply 
that ut < 1 and dx < (1 + e)d. rj 

Next we apply the method of conditional probabil- 
ities. Let T, dt, Ut, Ct, and cj>t (0 < t < T) be defined as 
in the proof of Guarantee d, ll for the k- medians rounding 
scheme. That proof showed that E[4>t] < Inn, and that 
if <f>T < In n then F meets the performance guarantee. 

To obtain the greedy algorithm, in each iteration 
we replace the random choices by deterministic choices. 
Let dt, u t , and dt denote, respectively, the assignment 
cost, number of unassigned elements, and facility cost 
at the end of the tth iteration of the greedy algorithm 
(analogous to dt, Ut, and Ct for the randomized algo- 
rithm) . The greedy algorithm will make its choices in a 
way that maintains the invariant 

E[4>t \ dt = dt Auj = ut Act = St] < Inn. 

Note that the expectation above is with respect to the 
random experiment. That is, the invariant says that if, 
starting from the current configuration, the remaining 
choices were to be made randomly, then (in expectation) 
<fi would end up less than Inn. 

Define <f> t = c t /k + ln[u t + (d t /d- 1) / (1 + e)] (anal- 
ogous to <f>t for the randomized algorithm). The proof 
of Guarantee 16 . II easily generalizes to show E[4>t | dt — 
d t A ut — u-t A Ct = Ct] < 4>t- Thus, it suffices to main- 
tain the invariant <f> t < Inn. Since (f>o = 4>o < Inn, the 
invariant holds initially. 

During each iteration t, the algorithm chooses a 
facility / and assigns it a set of customers C so that 
4>t < <f>t—i- A calculation shows <f>t — 4>t—i is less than 



cost(/) _ u~t-i -u t + (dt-i/d- d t /d)/(\ + e) 
k u t _i + (d t _i/d-l)/(l + e) 

It suffices to choose / and C so that the above is non- 
positive. Whatever Ut-i, Ct-i, and dt-i are, if / and C 
are chosen randomly, then the expectation of the above 
is zero. Thus, there is some choice of / and C which 
makes it non-positive. Thus it suffices to choose / and 
F to maximize 

Ut-i -u t + (dt-i/d - d t /d) /(I + e) 
cost(/) 

The algorithm considers each facility /; for each /, it 
determines the best set C of customers to assign. The 
algorithm is shown in Fig. |3J 

The termination condition in the algorithm differs 
from the one in the rounding scheme, but the modified 
termination condition suffices because it follows from 
the analysis that whatever k is, the algorithm will 
terminate no later than the first iteration such that the 
facility cost exceeds fcln(n + ne). By the derivation, 

Guarantee 6.2. Given e, and d such that a fractional 
solution of cost k and assignment cost at most d ex- 
ists, the greedy weighted k-medians algorithm (Fig. 0) 
returns a solution F such that dist(_F) < (1 + e)d and 
cost(F) < fcln(n + n/e) + max/ cost(/). 

Without loss of generality (since we are approximately 
solving the IP) maxj cost(/) < k. For the unweighted 
problem, the number of iterations is 0(fcln(n/e)). No 
facility is chosen twice, so the number of iterations is 
always at most m, the number of facilities. 

Corollary 6.1. Let < e < 1. The weighted k- 
medians problem has a [(1 + e)d; (1 + ln(n + n/e)k)]- 
approximation algorithm that runs in 0(m) linear-time 
iterations, or 0(k ln(n/e)) iterations for the unweighted 
problem. 
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input: fractional k-medians solution (x*,k,d), < e < 1. 




output: random fractional solution x s.t. cost(x) = (1 — t)~ 1 k 


and £[dist(x)] < (1 - e)~ 2 d. 


1. Choose N > ln(rc/e)/ch(— e) s.t. iVfc is an integer. 


... i?eca£Z ch(e) > eln(l + e). 


2. For each f,c do: x(/) «— &(/, c) <— cc(c) <— 0. 




3. Repeat N k times: 




4. Choose a single facility / at random so that Pr(/ chosen) 


= x*(f)/k. 


5. Increment x{f). 




6. For each customer c, with probability x*(f, c)/x*(f) do: 




7. Increment x(f,c) and x(c). 




8. Return x, where x = x j (1 — e)iV. 





Figure 4: Fractional fc-medians rounding scheme. 



7 Lagrangian relaxation for k- medians. 

In this section we derive and analyze a Lagrangian- 
relaxation, [(1 + e) 2 d; (1 + e)k] -approximation algorithm 
for the fractional unweighted fc-medians problem. The 
rounding scheme is shown in Fig. 21 We use the Chernoff 
and Markov bounds to bound the probability of failure. 

Guarantee 7.1. Let x be the output of the fractional 
k-medians rounding scheme. Then with positive prob- 
ability x has cost(ir) < (1 — e)~ 1 k and dist(ai) < 
(l-e)- 2 d. 

Proof. Recall k = cost(x*) = \x*\ and d — dist(x*). 
The bound on the cost always holds, because each of 
the kN iterations adds 1 to cost(cc). It remains to show 
that with positive probability, after the final iteration, 
dist(x) < (1 - e)- 1 Nd and each x{c) > (1 - e)N . 

Since each iteration increases dist(x) by d/k in 
expectation, finally E[dist(x)] < (Nk)d/k = dN. By 
the Markov bound, Pr[dist(x) > dN/(l - e)] < 1 - e. 

For any customer c, x{c) is the sum of kN indepen- 
dent 0-1 random variables each with expectation at least 
1/k, so by the Chernoff bound, Pr[x(c) < (1 — c)N] < 
exp(— ch(— e)N), which is at most e/n by the choice of 
N. 

By the naive union bound, Pr[dist(a;) > dN/(\ — 
e) V min c x(c) < (1 - e)N] < (1 - e) + n(e/n) = 1. n 

Next we sketch how the method of conditional 
expectations yields the algorithm shown in Fig. The 
proof of Guarantee l7. ll implicitlv bounds the probability 
of failure by the expectation of 

distp) x ^ (1- e) x ^ 
dN/(l-e) + ^ (l-e)(i-^) JV 

and it shows that the expectation is less than 1. An 
upper bound (called a "pessimistic estimator" Jl]) of 



the conditional expectation of the final value of the 
above, given the current value of x and the number t 
of remaining iterations, is 

g dist(x) +td/k ^ (1 - e )x(c) e -te/k 

{X,t) dN/(l-e) + ^ (l-e)(i-^ 

The algorithm chooses / and C in each iteration in order 
to minimize the increase in the above quantity, which 
consequently stays less than 1. 

Guarantee 7.2. The fractional k-medians algorithm 
in Fig. returns a fractional solution x having 
cost(x) < (1 — e) _1 fc and dist(x) < (1 — e)~ 2 d. 

Proof. (Sketch.) Let $ be as defined above. The 
algorithm maintains the invariant that with t iterations 
remaining, $(a;,i) < 1. It is straightforward to verify 
that the invariant is initially true, and that if is true 
at the end, then the performance guarantee is met. We 
verify that the invariant is maintained at each step. The 
increase in $ in a single iteration is proportional to 

]T dist(/, c) - + £ „( c ) e «/*(i -e[ceC])-^ y(c) 
cec c c 

where y(c) = a (1 — e) x ^ before the iteration for 
suitably chosen scalar a > 0. If / and C were chosen 
randomly as in the rounding scheme, the expectation of 
the above would be at most 0. The choice made by the 
algorithm minimizes the above quantity, therefore the 
algorithm maintains the invariant. rj 

Corollary 7.1. Let < e < 1. The fractional k- 
medians problem has a [(l + e)d, (1 + e)k\- approximation 
algorithm that runs in 0(fcln(n/e)/e 2 ) linear-time iter- 
ations. 
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input: fractional k-medians instance, < e < 1, k, d. 

output: fractional solution x s.t. cost(x) < (1 — e)~ l k, and dist(:r) < (1 — e)~ 2 d. 

1. Choose N > ln(n/e)/ch(— e) such that N k is an integer. ... Recall ch(— e) > e 2 /2. 

2. For each f,c do: x(f) <- x(f, c) <- x(c) <- 0; y{c) = e(l - e)- 1 Nde ch{ ^ N . 

3. Repeat Nk times: 

4. For each c do: set y(c) <— y(c) e £ / fc . 

5. Choose a single facility / and a set of customers C to maximize X^cec 2/( c ) — dist(/, c )- 

6. Increment x(/). 

7. For each c £ C do: Increment x(/, c) and x(c), and set j/(c) <— (1 — e)y(c). 

8. Return £, where x — x / (1 — e)iV\ 



Figure 5: Lagrangian-relaxation algorithm for fractional fc-medians. In step 4, it suffices to choose the best set of the 
form C = {c : y(c) > dist(/. c)} for some /. This can be done in linear time. 



8 Lagrangian relaxation for facility location. 

The rounding scheme for fractional facility location 
is the same as the rounding scheme for fractional fc- 
medians in Fig. 01 except for the termination condition. 
The modified algorithm terminates after the first itera- 
tion where each x{c) > (1 — e)N , where N is chosen to 
be at least ln(n)/ch(— e) s.t. (1 — e)N is an integer. We 
use the Chernoff-Wald bound to analyze the scheme. 

Guarantee 8.1. Let x be the output of the fractional 
facility-location rounding scheme as described above. 
Then x is a fractional solution to the facility location 
LP and E[cost(x) + dist(x)] < (1 - e) _1 (d + k). 

Proof. The termination condition ensures that all cus- 
tomers are adequately covered. It remains to bound 
E[cost(x) + dist(x)]. 

Let r.v. T denote the number of iterations of the 
rounding scheme. In each iteration, the expected in- 
crease in cost(x)+dist(j:) is k/\x*\ J rd/\x*\. By Wald's 
inequality, at termination, E[cost(x) + dist(x)] < 
E[T]{k + d)/\x*\. It remains to show E[T] < N\x*\. 
(Recall that x = x/(l ~ e)N.) 

For any customer c, the probability that x(c) is 
incremented in a given iteration is at least l/|x*|, 
independently of the previous iterations. Let r.v. M = 
min c x(c) at the end. Note that, by the choice of N, in 
fact M = (1 - e)N. 

By the Chernoff-Wald bound, (1 - e)N = E[M] > 
(l-e)E[T]/\x*\ (provided ch(-e) > ]n(m)(l-e)/E[M], 
which indeed holds by the choice of N). Rewriting gives 
E[T]<N\x*\. n 

Next we sketch how applying the method of con- 
ditional expectations gives the Lagrangian-relaxation 
algorithm shown in Fig. Below we let Xj denote 



the value of x after the final iteration of the algo- 
rithm and x denote the value at the "current" itera- 
tion. The analysis of the rounding scheme shows that 
£ , [cost(cc/) + dist(x/)] < N (k + d). The conditional 
expectation of cost(x/) + dist(ir/) at the end, given 
the current x, is cost (a;) + dist(x) + i?[£|x](fc + d)/\x*\, 
where random variable t is the number of iterations left. 

The proof of Chernoff-Wald, in this context, ar- 
gues that the quantity M{x) l°gi- e X} c (l — e) x ^ is 
log 1 _ e n initially, at most (1 — e)JV finally, and decreases 
in expectation at least —e/(\x* \ ln(l — e)) in each iter- 
ation. An easy generalization of the argument shows 
-E[i]x] is at most ^T^^j^n^y) ■ This gives us our pes- 
simistic estimator: E[cost(xf) + dist(xj) | x] < 
where 



$(a?) = cost(a;) + dist(x) + (k + d) 



(1 - e)N - M(x) 
-e/ln(l-e) ' 



The algorithm chooses / and C to keep $ from increas- 
ing (although not necessarily to minimize $) at each 
round. 

Guarantee 8.2. Let x be the output of the algorithm 
shown in Fig. Then x is a fractional solution to 
the facility location LP and cost(x) + dist(x) < (1 — 
e) _1 mm^ dist(a;) + cost(x), where x ranges over all 
fractional solutions. 

Proof. (Sketch) Define $ as above. The algorithm 
maintains the invariant $(x) < (k+d)N. A calculation 3 
shows that the invariant is initially true by the choice 
of N. Clearly if the invariant is true at the end the 



a This proof is an adaptation of part of the Chernoff-Wald proof 
to this context. For further details on parts marked with this 
footnote, see that proof. 
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input: fractional facility-location instance, < e < 1. 

output: fractional solution x s.t. cost(x) + dist(x) < (1 — e)^ 1 (d + k). 

1. Choose N > ln(n)/ch(— e) such that N(l — e) is an integer. ... Recall ch(— e) > e 2 /2. 

2. For each f,c do: x(f) <— x(f, c) «— x(c) <— 0; y(c) <— 1. 

3. Repeat until each x(c) > (1 — e)7V: ^ 

4. Choose a single facility / and a set of customers C to maximize ' " ; z< 



5. Increment *(/). C0St(/) + dist (/> c ) ' 

6. For each c £ C do: 

7. Increment x(/, c) and x(c), and set y(c) <— (1 — e)y(c). If x(c) > iV set y(c) <— 0. 

8. Return i, where x = x / (X — e)N. 



Figure 6: Lagrangian-relaxation algorithm for fractional facility location. In step 4, it suffices to choose the best set 
of the form C = {c : y(c)/dist(/, c) > A} for some A. 



performance guarantee holds. In a given iteration, the 
increase in $ is at most 3 (k + d)e times 

cost(/) +^ egc dist(/, c) _ J2cecy( c ) 
k + d E c 2/(c) 

where y(c) = (1 — e) xlyC \ If / and C were chosen 
randomly as in the rounding scheme, the expectation 
of the above quantity would be non-positive. 3 Thus, 
to keep it non-positive, it suffices to choose / and c to 
maximize 

I2cecy( c ) 

cost(/) + E ce c dist (/> c ) 

which is what the algorithm does. rj 

In each iteration, at least one customer c with 
x(c) < N has x{c) incremented. Thus the number 
of iterations is 0(nm(n)/e 2 ). Each iteration can be 
implemented in linear times 0(ln(n)) time. Thus, 

COROLLARY 8.1. Let < e < 1. Fractional facility 
location has a [(1 + e)(d + k)]- approximation algorithm 
that runs in 0(nln(n)/e 2 ) iterations, each requiring 
time linear in the input size times Inn. 

9 Further directions 

Is there a greedy [d + H&k\ -approximation algorithm 
for facility location? A [(1 + e)d; (1 + ln(n + n/e))k]- 
approximation algorithm for weighted fc-medians? A 
[d, (1 + e)k]- or [(1 + e)d, k] -approximation algorithm 
for fractional /c-medians? A Lagrangian-relaxation al- 
gorithm for fractional weighted fc-medians? 

The running times of all of the algorithms here 
can probably be improved using techniques similar to 
the one that Fleischer applied to improve Garg and 



Konemann's multicommodity flow algorithm 0], or 
(depending on the application) using standard data 
structures. 

In practice, changing the objective function of the 
LP relaxation of the IP to better reflect the performance 
guarantee might be worthwhile. For example, if one 
is going to randomly round a fractional solution x to 
the facility-location LP, it might be better to minimize 
dist(x) + cost(/)x(/)7?A / rather than dist(a;) + 
cost(x). This gives a performance guarantee that is 
provably as good, and may allow the LP to compensate 
for the fact that the difficulty of approximating the 
various components of the cost varies. 
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A if -medians via fc-set cover via PST. 

For completeness, we discuss a relation between fractional 
fc-medians and the mixed/packing covering framework of 
Plotkin, Shmoys, and Tardos (PST) [T3]. First we consider 
the fc-set cover problem — a variant of weighted set cover in 
which each set s is given a "distance" dist(s) £ IR + , and the 
goal is to choose a cover (a collection of sets containing all 
elements) of size at most fc, minimizing the total distance. 

We formulate the decision problem (given d, is there 
a set cover of size at most fc and distance at most d?) 
as a mixed packing/covering problem. Let P = {x : 

Xs — K}. For x £ P define x e = ^2 33e x s and 
dist(x) = ^2 x s dist(s). Then the fractional K-set cover 
problem is the packing/covering problem 3?x £ P : (Ve)a; e > 
1; dist(:r) < d. 

We can solve this using the PST algorithm as follows. 



The input to that algorithm is (distQ, fc, d, e). We can 
use it to compute an approximate solution x such that 
(ye)x e > 1 — e and dist(a;) < (1 + e)d (provided the original 
problem is feasible). We scale x, multiplying it by 1 + O(e), 
to get the final output. 

With care, we can show that to implement the PST 
algorithm, it suffices to have a subroutine that, given a vector 
a, returns x £ P minimizing dist(z) — a e x e . An optimal 
x can be found by enumerating the sets and choosing the set 
s that minimizes dist(s) — X^ess ae ' 

The running time of the PST algorithm is dominated by 
the time spent in this subroutine. The subroutine is called 
0(p ln(m) /e 2 ) times, where m is the number of elements and 
p is the width of the problem instance, which in this case is 
fc max a {dist (s)/<i, 1}. Thus, 

Corollary A.l. The fractional k-set cover decision prob- 
lem reduces to a mixed packing /covering problem of width 
p — k max a {dist(s)/d, 1}. If a problem instance is feasi- 
ble, the algorithm of \13H yields a fractional solution x with 
\x\ < (1 + e)fc and dist(s) < (1 + e)d in time linear in the 
input size times 0(pln(m)/e 2 ) . 

In many cases, we can assume without loss of generality 
that max s dist(s) < d, in which case the width is fc. 
Except for the fact that this is a decision procedure, this is 
comparable to Corollary 17. II (Although that bound requires 
no assumption about d.) 

Next we sketch how weighted fc-medians reduces to fc- 
set cover. We adapt Hochbaum's facility-location-to-set- 
cover reduction. Fix a weighted fc-medians instance with 
n facilities and m customers C. Construct an (exponentially 
large) family of sets as follows. For each facility / and subset 
C of customers, define a set Sfc = C, with dist(S'/c) = 
^ cgC dist(/, c) and cost(S/c) = cos "t(/). Then each fc- 
medians solution corresponds to a fc-set cover, and vice versa. 
The bijection preserves dist, and extends naturally to the 
fractional problems as well. 

Even though the resulting fractional fc-set is exponen- 
tially large, we can still solve it efficiently using PST pro- 
vided we have a subroutine that, given a vector a, effi- 
ciently finds a facility / and set of customers C minimizing 
^2 ceC dist(/, c) — q c . This C and / can in fact be found by 
choosing the facility / minimizing ^2 cGC dist(/, c) — a c , 
where Cj — {c : dist(s) < a c .}. Thus, we have 

Corollary A. 2. The fractional weighted k-medians deci- 
sion problem reduces to a mixed packing /covering problem 
of width p = fc max/{^ c dist(/, c)/d, 1}. If a problem in- 
stance is feasible, the algorithm of yields a fractional 
solution x with \x\ < (l + e)fc and dist(x) < (l + e)dintime 
linear in the input size times 0(pln(m)/e 2 ). 

If each dist(/, c) < d, then p < km, where m is the number 
of customers. This bound on the running time is a factor of 
m worse than the bound in Corollarv l7.1l (though a reduction 
yielding smaller width may be possible). 



